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Abstract  -  Fusion  entails  both  the  association  and 
synthesis  of  information.  If  missassociations  occur, 
they  obviously  undermine  the  gains  won  by  synthe¬ 
sis,  compromising  the  fusion  product.  An  analytic 
framework  is  presented  here  to  study  the  competi¬ 
tion  between  the  negative  effect  of  missassociation 
and  the  positive  effect  of  synthesis,  to  demonstrate 
and  analyze  their  interplay  quantitatively.  Here  the 
quality  of  information  being  enhanced  or  degraded 
is  taken  to  be  the  extent  to  which  the  information 
correctly  determines  a  decision  or  action  inference. 
To  say  that  the  uncertainty  injected  by  missassocia¬ 
tion  may  overwhelm  the  uncertainty  reduction  won 
by  synthesis,  for  instance,  would  mean  that  this 
inference- determining  quality  of  information  falls 
in  fusion  below  that  of  the  best  information  source 
working  independently.  This  is  ultimately  a  study 
in  uncertainty  dynamics:  the  beneficial  reduction  of 
uncertainty  by  synthesis  in  fusion  versus  the  detri¬ 
mental  increase  of  uncertainty  due  to  association, 
which  are  both  always  present  in  fusion  to  some  de¬ 
gree. 

Keywords:  Data  association,  missassociation,  informa¬ 
tion  quality,  fusion  performance,  fusion  theory. 

1  Introduction 

Fusion  can  be  represented  schematically  as  in  Fig.  (1). 
The  arrows  indicate  the  flow  of  information,  from  var¬ 
ious  sources  coming  together  to  produce  another  in¬ 
formation  product  on  the  right,  the  output  presum¬ 
ably  more  useful  in  some  important  respect.  The  fu¬ 
sion  occurs  inside  the  box,  which  could  represent  the 
human  mind,  an  expert  system,  a  combined  human- 
machine  system,  and  so  forth.  Beginning  from  this 
simple  schematic,  we  might  ask  if  there  are  principles 
governing  the  dynamics  of  the  quality  of  information  in 
fusion.  Is  it  true,  for  instance,  that  fusion  necessarily 
improves  maters?  Are  there  conditions,  for  instance, 
under  which  improvement  might  for  some  reason  be 
impossible? 

Fusion  entails  association  and  synthesis — the  asso¬ 
ciation  first  of  all  of  subject  matter  between  the  infor¬ 
mation  sources  to  be  fused,  and  then  the  synthesis  of 
the  information  pertaining  to  that  subject  matter  [1], 
Synthesis  is  often  the  focus  of  discussion  in  fusion.  It 
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Figure  1:  A  conceptual  schematic  of  fusion.  Associa¬ 
tion  is  the  correlation  of  subject  matter  treated  by  all 
incoming  information  sources.  Synthesis  is  the  combi¬ 
nation  of  that  information  into  a  single  fusion  product. 


is  what  adds  value  to  information.  If  it  were  a  mat¬ 
ter  of  synthesis  alone,  then  the  prospects  for  fusion  (in 
theory  at  least)  are  always  good.  More  independent 
information  generally  means  better  information  in  to¬ 
tal.  What  must  not  be  forgotten,  however,  is  that  is 
there  can  be  no  fusion  without  association. 

Illustrating  briefly,  imagine  a  surveillance  aircraft  on 
a  routine  mission  overflying  and  photographing  a  ship 
at  sea.  The  crew  on  board  the  aircraft  hail  the  ship  to 
collect  routine  information  about  destination,  purpose, 
and  so  forth,  which  in  this  case  apparently  contradicts 
the  heading  of  the  ship  and  the  equipment  evident  on 
its  decks.  The  crew  in  the  air  grow  suspicious,  alert 
surface  forces  for  interception,  and  so  forth,  not  real¬ 
izing,  however,  that  their  visual  contact  was  with  one 
ship,  while  their  radio  contact  was  with  another  very 
distant  ship.  The  fusion  between  independent  visual 
and  radio  information  took  place  in  the  mind  of  the 
crew  on  the  implicit  assumption  that  there  was  just 
one  subject,  whereas,  in  reality  there  were  two.  It  is  a 
simple  error  of  association.  Elsewhere  a  single  subject 
might  be  treated  as  several.  When  associating  informa¬ 
tion  between  several  databases  for  instance,  using  the 
name  of  a  person  or  a  ship  to  relate  records,  the  name 
may  be  entered  differently  in  any  number  of  places,  in 
any  number  of  ways,  by  typing  errors,  changes  in  con¬ 
vention,  omissions,  which  make  the  one  subject  appear 
as  several  independent  subjects. 

There  are  of  course  ways  to  avoid  these  kinds  of  as¬ 
sociation  errors.  The  points  to  be  made  nonetheless 
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are  1)  that  when  dealing  with  situations  of  increasing 
complexity,  the  uncertainty  about  correct  association, 
or  the  probability  of  missassociation,  tends  as  a  rule 
to  increase;  and  2)  that  the  more  independent  the  in¬ 
coming  sources  of  information  are,  the  more  potential 
they  hold  for  gains  through  synthesis,  but  the  more 
uncertainly  they  are  also  associated  together.  Thus 
the  system  developer  is  likely  at  some  stage  to  face  di¬ 
minishing  returns.  Fusion  may  actually  fail  to  improve 
matters,  and  may  even  significantly  degrade  matters. 
Much  as  synthesis  holds  only  positive  potential  of  im¬ 
proving  information,  association  holds  only  negative 
potential  for  degrading  it,  and  both  are  always  at  work 
to  some  degree  in  fusion.  Our  question  for  fusion  may 
therefore  be  restated  this  way:  To  what  extent  will 
the  probabilities  of  missassociation  undermine  the  im¬ 
provements  by  synthesis? 

This  paper  addresses  the  question  analytically,  in  a 
generic  way,  as  a  step  (it  is  hoped)  toward  develop¬ 
ing  engineering  rules  of  thumb  by  which  to  judge  the 
prospects  for  fusion,  by  analysis,  before  a  fusion  system 
is  built  and  fielded,  and  to  help  identify  and  diagnose 
problems  with  fusion  in  operation. 

That  fusion  entails  association  is  of  course  nothing 
new.  But  in  the  analysis  of  fusion  it  is  often  either  ex¬ 
plicitly  asserted  or  implicitly  assumed  that  the  subject 
matter  treated  by  all  of  the  incoming  sources  of  in¬ 
formation  can  be  unambiguously  associated;  that  two 
independent  subjects  have  not  mistakenly  been  con¬ 
fused  as  one,  or  that  one  subject  has  not  been  con¬ 
fused  as  two  or  more.  What  has  been  missing  is  the 
admission  that  association  errors  can  occur  in  prac¬ 
tice,  with  a  subsequent  quantitative  analysis  of  what 
their  affect  is  expected  to  be.  In  some  cases  associa¬ 
tion  uncertainties  have  been  ignored  because  the  ap¬ 
plication  is  of  such  a  tightly  controlled  or  consistent 
nature  that  an  unambiguous  correspondence  between 
the  subject  matter  treated  by  each  information  source 
was  assumed  to  be  assured  by  design.  This  is  gener¬ 
ally  (though  not  strictly)  true  of  the  biological  exam¬ 
ples  often  cited  in  the  early  fusion  literature,  such  as 
the  fusion  of  sound  and  sight  in  animals  hunting  prey; 
the  multiple  sensors  in  these  cases  being  co-located, 
synchronous,  and  correlated  by  a  brain  evolved  for  the 
purpose.  Association  uncertainties  have  also  been  ig¬ 
nored  because  the  information  sources  were  assumed 
(often  mistakenly  in  practice)  to  be  totally  error  and 
ambiguity  free.  The  information  in  data  bases,  for 
instance,  such  as  in  a  daily  report  of  harbour  vessel 
traffic  and  the  historic  record  of  vessel  traffic,  might 
treated  as  if  uncontaminated  by  entry  errors,  spelling 
mistakes,  omissions  or  duplications,  synonymic  varia¬ 
tions,  ontological  ambiguities,  and  so  forth,  whereas, 
what  was  one  day  entered  as  a  “fishing  boat”  might  on 
other  days  been  entered  as  a  “trawler” ,  ship  names  and 
codes  may  have  been  entered  with  spelling  errors,  and 
so  forth,  all  of  which  undermines  associations  made  be¬ 
tween  data  bases,  and  even  within  the  same  data  base. 
Elsewhere  association  uncertainties  might  be  ignored 
if  their  effects  would  be  so  obviously  dangerous  that 
operation  under  any  shadow  of  association  uncertainty 


would  be  unthinkable.  One  either  operates  with  full 
confidence  in  association,  or  one  does  not  operate  at 

all. 

1.1  Scope 

A  conceptual  model  of  fusion  is  proposed  here  that  il¬ 
lustrates  the  competition  of  forces  quantitatively,  for  a 
particular  kind  of  fusion,  when  the  fusion  product  is  in¬ 
tended  to  determine  a  particular  decision/ action  infer¬ 
ence  with  more  reliability  than  any  of  the  fused  sources 
of  information  working  independently  could  determine 
the  same  inference.  This  would  include,  for  instance, 
the  fusion  of  various  target  detection  and  classification 
sensors,  the  fusion  of  decisions  of  various  experts  (hu¬ 
man  or  expert  system)  on  one  given  question,  or  the 
fusion  of  various  data  bases  to  detect  anomalous  ac¬ 
tivity  for  counter  terrorism  or  maritime  security,  this 
would  not  include  fusion  for  (continuous)  parameter  es¬ 
timation,  such  as  the  refinement  by  multi-sensor  fusion 
of  a  target’s  range  and  bearing,  or  fusion  that  merely 
increases  coverage,  such  as  the  side-by-side  presenta¬ 
tion  of  information  from  sensors  viewing  independent 
but  neighboring  regions  of  the  earth. 

By  decision/action  inference  is  meant  that  the  pur¬ 
pose  of  information  gathering  and  fusion  is  to  support 
making  decisions  or  taking  action.  It  may  be  the  deci¬ 
sion  to  change  one’s  stance  or  attitude  toward  a  radar 
contact,  to  commit  resources  to  further  information 
gathering,  to  continue  with  one’s  present  actions  with¬ 
out  change,  to  issue  a  command  or  raise  an  alarm, 
and  so  forth.  In  every  case  an  agent,  human  or  ma¬ 
chine,  considers  information  in  light  of  some  decision 
to  be  made,  or  action  to  be  taken,  that  is  intimately 
related  to  that  agent’s  situation  awareness.  The  util¬ 
ity  of  the  information  then  depends  above  all  else  on 
the  reliability  with  which  the  inference  is  made.  It 
also  depends  very  strongly  on  the  nature  of  the  ap¬ 
plication.  The  analysis  therefore  begins  by  relating 
utility  measures  to  the  performance  probabilities  for 
inference,  with  the  performance  probabilities  serving 
as  generic  surrogate  metrics  for  more  specialized  util¬ 
ities.  Unlike  information  theory,  then,  we  are  asking 
about  the  utility  of  information,  not  about  its  quantity 
in  bits,  which  correlates  poorly  with  its  ultimate  utility 
in  human  terms.  Information  is  good  or  bad  insofar  as 
it  produces  good  or  bad  decisions  or  actions.  The  ques¬ 
tion  posed  earlier  for  fusion  may  therefore  be  restated 
more  precisely  again:  To  what  extent  will  a  probability 
of  missassociation  in  fused  information  undermine  the 
decision/ action  inferences  based  on  that  information? 

For  simplicity  in  the  final  model,  the  decision/action 
inference  is  assumed  to  be,  in  effect,  a  choice  between 
just  two  predefined  options,  labeled  0  and  1,  as  in  tar¬ 
get  detection  and  classification,  for  example,  when  an 
object  in  a  scene  that  is  under  surveillance  by  sev¬ 
eral  sensors  is  classified  as  a  target  (1)  or  clutter  (0). 
But  any  two-option  inference  is  possible.  And  just  two 
sources  of  information  are  considered  for  fusion.  The 
approach  is  generic  inasmuch  as  the  way  in  which  deci¬ 
sion/action  inferences  are  made,  by  human  or  automa¬ 
tion,  and  the  way  in  which,  or  algorithm  by  which, 
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the  fusion  is  carried  out,  are  for  the  most  part  of  no 
consequence.  Performance  and  utility  are  the  key  pa¬ 
rameters. 


2  Utility  of  information 

The  quality  of  information  lies  ultimately  in  the  qual¬ 
ity  of  the  decision/action  inference  that  might  be  made 
from  it,  which  is  the  probability  that  the  inference 
is  correctly  made.  This  is  the  conditional  probabil¬ 
ity  Pc  that,  given  a  world  in  state  m,  the  appropriate 
resource-affecting  inference  m  is  made  about  that  state 
of  the  world,  on  the  basis  of  the  information  available 
amid  uncertainty, 

P, p  =  P  (  infer  m  |  given  state  m) .  (1) 

Pp  is  not  necessarily  a  measure  of  utility  itself,  but 
it  is  a  driver  behind  any  utility  metric  U  when  being 
correct  in  decision  and  action  is  paramount.  Whatever 
U  may  be,  that  is  to  say,  it  will  be  a  monotonic  function 
of  the  conditional  probabilities  Pp, 

dU 

U  =  U  (P^,)  and  f)pc  —  0  f°r  ah  m->  (2) 

assuming  that  higher  U  means  higher  utility.  U  might 
be  the  probability  of  being  correct  during  a  mission 

M-l 

U  =  pmP%,  (3) 

m= 0 

for  example,  in  which  a  possibly  relevant  state  of  the 
world  m  is  expected  to  appear  with  relative  frequency 
pm,  and  there  are  M  such  states.  Or  U  might  be  the  ex¬ 
pected  benefit  (negative  expected  cost)  of  operations. 
There  are  of  course  other  dimensions  to  the  utility  of 
information,  like  timeliness,  distribution,  and  access, 
but  here  we  are  interested  in  the  information  that  hap¬ 
pens  to  be  available  at  the  time  a  decision/action  infer¬ 
ence  is  made,  not  in  the  information  that  might  have 
been  available. 


2.1  Performance  probabilities 

Performance  probabilities  for  two-option  inferences  can 
be  modeled  very  generally  as  a  thresholded  decision 
variable.  That  is  to  say,  given  one  of  two  mutually 
exclusive  states  of  the  world,  labeled  m  =  0  or  1,  the 
expected  (average)  performance  of  a  human  or  a  ma¬ 
chine  making  an  inference  amid  uncertainty,  /  =  0  or 
1,  on  the  basis  of  their  information  about  which  state 
holds,  can  generally  be  modeled  as  if  /  were  deter¬ 
mined  by  the  value  of  a  decision  variable  c  relative  to 
a  threshold  Ct, 


f  0  if  c  <  ct, 
(  I  if  c  >  ct, 


(4) 


where  c  is  a  random  variable  drawn  from  a  probabil¬ 
ity  distribution  conditioned  by  the  true  state  of  the 
world,  as  shown  in  Fig. (2).  This  kind  of  model  is  used 
in  physchophysics  for  human  behavior  [2],  for  instance, 


prob. 


Figure  2:  The  performance  of  an  inference  agent  is 
modeled  as  a  thresholded  random  decision  variable  c 
whose  distribution  is  conditioned  by  the  state  of  the 
world,  of  which  there  are  only  two  of  interest  in  this 
case,  m  =  0  and  1.  Greater  separation  d  between  the 
distributions  means  better  quality  inferences. 


without  claiming  that  c  or  ct  exist  in  reality,  and  with¬ 
out  making  any  assertions  about  the  process  by  which 
the  assertion  was  made,  c  and  ct  are  simply  a  means 
for  modeling  the  average  (expected)  performance  or 
reliability  of  inferences  made  amid  uncertainty,  with¬ 
out  predicting  or  explaining  individual  inferences.  The 
model  in  Fig.  (2)  is  used  in  much  the  same  way  here, 
making  no  claims  about  what  the  information  may  be, 
no  claims  about  the  process  or  algorithm  by  which  the 
decision-action  inference  is  made,  and  no  claims  about 
what  a  particular  individual  inference  will  be.  The 
model  therefore  applies  very  generally  to  many  differ¬ 
ent  situations. 

In  the  example  below,  the  density  functions  in 
Fig.  (2)  are  for  simplicity  taken  to  be  Gaussian  of  unit 
variance,  with  separation  d  between  means.  Increas¬ 
ing  the  separation  implies  that  either  the  information 
or  the  inference  agent,  or  both,  have  been  improved. 
In  this  paper  it  is  the  quality  of  information  that  is 
at  issue,  particularly  as  the  probability  of  missassoci- 
ation  is  increased  in  fusion,  so  the  intrinsic  ability  of 
any  inference  agent  is  assumed  to  be  constant. 

Figure  (3)  shows  the  range  of  performance  probabil¬ 
ities  Pq  and  Pi  that  might  result  for  a  given  informa¬ 
tion  source  as  the  inference  agent’s  aversion  to  making 
one  or  another  type  of  error  changes,  which  is  to  say, 
in  terms  of  the  model  in  Fig.  (2),  as  the  threshold  ct 
is  varied  along  the  c  axis.  Each  point  (Pp  ,  Pp)  on 
a  curve  in  Fig.  (3)  corresponds  to  a  different  value  of 
Ct  for  a  given  quality  of  information.  This  paramet¬ 
ric  representation  of  performance  is  much  the  same  as 
a  receiver  operator  characteristic  (ROC)  curve  in  de¬ 
tection  or  classification,  but  with  the  horizontal  axis 
reversed;  an  ROC  curve  typically  having  the  proba¬ 
bility  of  false  alarm  Pfa  plotted  along  the  horizontal 
axis  rather  than  Pp  =  1  —  Pfa •  A  uniform  treatment 
of  all  Pp  as  in  Fig.  (3)  simplifies  the  analysis  when 
multi-option  inferences  [m  =  0, 1, 2 . . .  M  —  1)  are  con¬ 
sidered,  when  the  parametric  curve  becomes  a  multi¬ 
dimensional  surface. 
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Figure  3:  The  performance  probabilities  P(f  and  Pf 
are  plotted  here  as  the  threshold  ct  is  varied.  The 
lighter  curve  signifies  an  inference  of  lower  utility  be¬ 
cause  for  equal  Pq  the  darker  curve  has  a  higher  Pf . 

2.2  Comparing  information  quality 

Given  N  information  sources  labeled  s  =  1,2 , N, 
one  would  have  have  N  different  models  of  the  type  in 
Fig. (2),  each  with  its  own  decision  variable,  threshold, 
and  probability  density  functions,  and  performance 
curve  in  Fig.  (3).  In  general,  the  form  of  the  utility 
function  U  would  have  to  be  known  in  order  to  rank 
all  of  these  from  best  to  worst  in  terms  of  their  ulti¬ 
mate  utility  for  the  same  application.  But  there  are 
conditions  owing  to  monotonicity  (2)  under  which  the 
best  or  worst  source  can  be  identified,  regardless  of 
what  U  may  be.  A  source  s  =  t,  for  instance,  whose 
performance  probabilities  P£  t  (with  second  subscript 
for  source  label)  are  maximum  in  the  following  respect 

Pm,t.  —  Pm  s  f°r  all  m  and  s  t,  (5) 

will  be  the  best  source  of  information,  offering  the  high¬ 
est  utility  in  decision  or  action.  On  the  other  hand,  a 
source  whose  performance  probabilities  are  minimum 

Pm,t  —  Pm,s  f°r  a^  m  and  s  ^  t  (6) 

will  be  the  worst  source  of  information,  offering  the 
lowest  utility.  These  are  sufficient,  but  not  necessary, 
conditions  for  being  best  or  worst.  They  apply  very 
generally  to  any  application,  allowing  us  to  determine 
in  some  cases  whether  fusion  rises  to  the  top,  or  falls  to 
the  bottom  rank  among  the  information  sources  fused. 

For  simplicity,  we  continue  now  with  the  two-option 
inference,  m  =  0,1,  and  the  fusion  of  just  two  infor¬ 
mation  sources,  s  =  1,2,  demonstrating  fusion  when 
associations  are  unambiguous,  and  again  with  associa¬ 
tion  uncertainty  (a  probability  of  missassociation) . 

3  Fusion  with  association  uncer¬ 
tainty 

Fusion  generally  divides  into  two  types.  The  simplest, 
decision  fusion,  is  when  inferences  Is  made  on  the 
strength  of  each  information  source  s  independently 


Figure  4:  The  two-dimensional  extension  of  Fig. (2)  for 
the  fusion  of  two  sources  of  information. 

are  fused  together  into  a  single  inference  If,  typically 
using  a  logical  AND  or  OR  operation,  or  a  weighted 
voting  scheme  [3]  ,[4],  And  there  is  feature  fusion, 
in  which  the  decision  variables  cs  or  other  inference¬ 
determining  features  are  fused,  creating  in  effect  an¬ 
other  decision  variable  Cf  by  which  a  single  fused  in¬ 
ference  IF  is  made.  In  principle,  feature  fusion  offers 
higher  utility  gains,  though  often  not  dramatically  high 
in  practice  because  thorough  knowledge  of  the  proba¬ 
bility  distributions  (as  in  Fig. (2))  is  required  to  opti¬ 
mize  performance,  which  is  rarely  available  practice. 
Decision  fusion,  on  the  other  hand,  tends  to  be  more 
robust  in  practice. 

Here  we  use  a  simple  form  of  feature  fusion  for  il¬ 
lustration.  Assume  that  each  of  two  sources  for  fusion 
promise  equal  utility  Pff,  ,  =  Pff  2 ,  both  being  derived 
from  Gaussian  conditional  probability  distributions  of 
unit  variance  as  in  Fig.  (2),  with  separation  between 
means  of  d  =  2.0.  Equality  in  performance  probabil¬ 
ities  implies  equal  expected  utility  U  in  independent 
operation,  but  does  not  imply  equality  between  infer¬ 
ences  (i.e.  ci  need  never  equal  C2,  nor  I\  equal  J2).  As¬ 
sume,  moreover,  that  the  information  provided  by  each 
source  is  independent.  Then  the  joint  conditional  dis¬ 
tributions  of  decision  variables  (ci,  C2)  are  as  shown  in 
Fig.  (4),  which  is  a  two-dimensional  extension  of  Fig.  (2) 
for  two-source  fusion.  That  is  to  say,  given  an  occasion 
for  inference,  the  inference-determining  cues  in  each 
information  source  each  contribute  in  effect  some  net 
inference-determining  cue  that  has,  it  can  be  assumed, 
roughly  the  same  independent  determining  power  as 
when  the  two  sources  of  information  are  used  inde¬ 
pendently.  Indeed,  each  individual  source  contributes, 
in  effect,  its  decision  variable  cm  on  which  its  indepen¬ 
dent  performance  depends.  Each  occasion  for  inference 
therefore  generates  the  ordered  (ci,  C2) .  A  simple,  ad¬ 
mittedly  suboptimal  way  of  modeling  the  fusion  of  the 
two  sources  into  a  single  inference  is  again  to  threshold 
each  decision  variable  as  shown  in  Fig. (4).  If  (01,02) 
lies  in  quadrant  #1,  then  the  fusion  result  is  If  =  1; 
otherwise  Ip  =  0.  In  the  absence  of  association  er¬ 
rors  this  is  in  fact  equivalent  to  decision  fusion  with  a 
logical  AND  rule. 
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Figure  5:  A  scatter  plot  of  points  for  the  MonteCarlo 
simulation.  The  distributions  here  are  arranged  much 
like  those  shown  schematically  in  Fig.(xx).  750  points 
are  plotted  of  the  5000  used  in  the  simulations.  The 
states  m  =  0  and  1  appear  with  equal  frequency. 


Figure  (5)  shows  the  resulting  scatter  plot  when 
many  occasions  for  inference  are  modeled  using  Monte- 
Carlo  simulation,  when  the  two  real-world  possibilities 
to  =  0  and  1  occur  with  equal  frequency.  The  per¬ 
formance  probabilities  are  estimated  by  counting  the 
number  of  points  in  various  quadrants  of  the  (ci,C2) 
plane  created  by  the  thresholds  ct- 


source  1 
source  1 
source  2 
source  2 
fusion  1&2 
fusion  1&2 


Pqi  =  (points  in  quadrants  3,  4 )/n 

Pi  l  =  (points  in  quadrants  1,  2 )/n 

Pq  2  —  (points  in  quadrants  2,  3 )/n 

Pi  2  =  (points  in  quadrants  1,  4 )/n 

PqF  =  (points  in  quadrants  2,  3,  4 )/n 

Pip  =  (points  in  quadrant  1  )/n 


(7) 

in  which  n  is  the  total  number  of  points  in  the  scatter 
plot.  Figure  (6)  shows  the  result  P%  3  when  n  =  5000, 
and  simulation  is  repeated  times  with  different  values 
of  threshold  ct ■  The  simple  fusion  is  clearly  superior 
to  either  source  working  independently  because  its  per¬ 
formance  probabilities  are  to  the  right  and  above  that 
of  both  sources  working  independently  (the  condition 
(5)).  No  possibility  of  association  error  has  been  ad¬ 
mitted  here  because  the  subject  matter  treated  by  the 
two  sources  were  unambiguously  associated  by  design, 
in  the  way  that  the  ordered  pairs  (ci,C2)  were  con¬ 
structed:  one  pair  for  each  real-world  occasion  for  in¬ 
ference,  and  each  real-world  occasion  calling  for  just 
one  inference. 

Now  let  us  introduce  the  prospect  of  association  er¬ 
rors  by  assuming  that  there  exists  a  class  of  real-world 
situations  in  which  the  occasions  for  inference  come  in 


pairs.  This  might  happen  in  maritime  surveillance,  for 
instance,  when  a  scene  contains  two  ships,  A  and  B. 
This  potentially  leads  to  a  confusion  in  subject  mat¬ 
ter  between  the  two  information  sources  because  the 
two  contacts  in  one  sensor’s  view  may  be  associated  in 
two  different  ways  with  the  two  contacts  in  the  other 
sensor’s  view,  with  only  one  way  being  correct.  The 
probability  of  confusion  may  be  higher  the  closer  the 
two  ships  are  together,  for  instance.  Similar  confusion 
can  occur  with  radar  contacts  in  air  traffic  surveillance, 
if  an  attacking  fighter  jet  “hides”  very  close  to  a  civil¬ 
ian  jetliner,  which  may  be  their  preferred  method  of 
approach. 

Let  the  probability  of  missassociation  error  in  fu¬ 
sion  be  PM .  When  the  occasions  for  inference  come  in 
pairs  and  there  is  no  systematic  bias  0  <  PAI  <  1/2; 
the  upper  limit  being  random  guessing.  The  worst- 
case  guessing  condition  would  presumably  occur  if  the 
system  developer  never  considered  the  prospect  of  two 
very  close  occasions  for  decision/action  inferences.  To 
illustrate  this  condition,  another  MonteCarlo  simula¬ 
tion  was  run  much  as  before,  for  n  =  5000  occasions 
for  inferences  in  which  the  two  world  states  0  and  1  oc¬ 
cur  with  equal  relative  frequencies,  randomly  distrib¬ 
uted  throughout  the  entire  sampling  of  n  occasions. 
The  n  occasions  for  inferences  were  then  grouped  into 
n/2  pairs.  In  one  half  of  those  pairs  one  decision 
variable  was  flipped;  that  is,  given  the  pair  of  or¬ 
dered  pairs  ( Ci,C2 )  and  (cf,qf)  correctly  associated, 
they  were  replaced  by  a  corresponding  missassociated 
pair  (cf ,  c£)  and  (cj4,  cf )  .  In  effect,  subject  A  in  one 
source  was  missassociated  with  subject  matter  B  in 
the  other  in  one  half  of  the  n  occasions  for  inference. 
The  performance  curve  for  fusion  with  these  missasso- 
ciations  was  recomputed  and  plotted  in  Fig.  (6). 

Fusion  at  this  missassociation  rate  clearly  causes 
more  harm  than  good.  Its  performance  curve  lies  en¬ 
tirely  below  and  left  of  each  source  acting  indepen¬ 
dently  (the  condition  (6)).  The  fact  that  missassocia¬ 
tion  undermines  fusion  is  of  course  no  surprise.  And  it 
may  look  here  as  if  we  have  set  up  a  straw  man,  only 
to  knock  it  down  again,  deliberately  sabotaging  fusion 
in  order  to  illustrate  an  obvious  vulnerability.  The  in¬ 
tent,  however,  was  to  create  an  analytic  frame  work 
in  which  the  vulnerability  can  be  quantified  and  ana¬ 
lyzed,  without  wishfully  ignoring  it  away,  to  determine 
how  insidious  it  might  be. 

In  the  present  example,  if  no  more  than  two  con- 
fusable  subjects  can  be  expected,  then  the  fusion  per¬ 
formance  would  presumably  fall  somewhere  between 
that  with  perfect  association  PAI  =  0,  and  that  when 
pM  _  Yj 21  depending  on  the  relative  frequencies  with 
which  unambiguous  single  occasions  and  confusable 
paired  occasions  for  inference  appear  in  the  real  world. 
The  maximum  effective  missassociation  rate  PAI  that 
fusion  could  support  before  it  would  be  better  not 
to  use  it,  would  be  when  the  fusion  performance  just 
equals  the  performance  of  the  best  source,  which  in  this 
example  happens  when  PAI  «  0.20.  In  other  words,  if 
the  system  developer  has  reason  to  expect  confusion 
rates  of  PAI  =  0.20  or  higher,  it  would  be  better  not 
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Figure  6:  Performance  curves  for  the  two  sensors  work¬ 
ing  independently  (one  overlaying  the  other),  and  for 
fusion  without  and  with  association  errors.  Unambigu¬ 
ous  fusion  outperforms  the  individual  sensors  demon¬ 
strating  the  benefit  of  synthesis.  Fusion  with  missas¬ 
sociation  more  than  cancels  those  benefits. 

to  use  fusion. 

Association  typically  worsens  as  the  number  of  con- 
fusable  occasions  for  inference  increases.  For  L  con- 
fusables,  PM  <  (1  —  1  /L),  driving  the  performance 
curves  still  worse  than  plotted  in  Fig.  (6).  In  the  heat 
of  battle,  of  course,  the  number  of  confusables  may 
increase  dramatically  and  unexpectedly.  The  fusion 
performance  will  degrade  accordingly.  These  potential 
degradations  must  be  considered  and  identified  in  ad¬ 
vance  of  operations,  to  be  either  remedied  or  addressed 
with  other  contingency  plans. 


ways  to  reduce  association  errors:  either  by  1)  gath¬ 
ering  and  integrating  still  more  information  of  quite  a 
different  nature  than  that  being  fused,  namely,  meta 
information  about  situation  and  context,  to  drive  down 
the  probability  of  missassociation;  or,  possibly,  by  2) 
reducing  the  permutations  and  combinations  of  confus¬ 
ables  by  reducing  the  number  of  sources  of  information 
used  in  the  fusion.  The  present  analysis  may  allow  ex¬ 
perimentation  with  both  options. 

Ultimately  the  analysis  would  lead  (it  is  hoped)  to 
quantitative  engineering  rules  of  thumb  that  identify 
good  candidates  for  fusion,  and  identify  situations  in 
which  fusion  is  likely  to  fail  in  practice,  before  the  sys¬ 
tems  are  built.  These  design  rules  would  presumably 
relate  the  maximum  probability  of  missassociation  PM 
to  the  number  and  quality  of  the  individual  sources  of 
information  when  fusion  is  a  break-even  proposition. 
One  would  want  to  ensure  that  PAI  remained  much 
less  than  that  in  critical  operations. 
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4  Conclusions 

Association  uncertainties  are  the  norm  in  fusion,  espe¬ 
cially  for  complex  applications  at  critical  moments.  It 
is  only  a  question  of  the  degree.  The  treatment  given 
here  enters  into  that  question  quantitatively. 

A  generic  model  of  fusion  was  introduced  to  quan¬ 
titatively  demonstrate  the  competing  dynamics  that 
inevitable  occur  to  varying  degrees  in  fusion,  namely, 
the  increased  quality  of  information  due  to  the  com¬ 
bination  of  independent  sources  of  information,  versus 
the  decreased  quality  owing  to  association  uncertain¬ 
ties.  Vulnerability  to  missassociation  has  always  been 
recognized,  of  course,  but  its  impact  is  modeled  quan¬ 
titatively  now  within  a  generic  framework  that  admits 
further  analysis.  There  are,  for  instance,  perhaps  two 
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